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Purified nucleocapsid protein (N protein) from transmissible gastroenteritis virus (TGEV) enhanced ham- 
merhead ribozyme self-cleavage and favored nucleic acid annealing, properties that define RNA chaperones, as 
previously reported. Several TGEV N-protein deletion mutants were expressed in Escherichia coli and purified, 
and their RNA binding ability and RNA chaperone activity were evaluated. The smallest N-protein domain 
analyzed with RNA chaperone activity, facilitating DNA and RNA annealing, contained the central unstruc- 
tured region (amino acids 117 to 268). Interestingly, N protein and its deletion mutants with RNA chaperone 
activity enhanced template switching in a retrovirus-derived heterologous system, reinforcing the concept that 
TGEV N protein is an RNA chaperone that could be involved in template switching. This result is in agreement 
with the observation that in vivo, N protein is not necessary for TGEV replication, but it is required for efficient 


transcription. 


RNA chaperones are proteins that allow the proper folding 
of nucleic acids (4, 12). Several characteristics distinguish RNA 
chaperones from other RNA binding proteins: a lack of spec- 
ificity; they are transiently needed; and there is no energy 
requirement for their function (19). RNA chaperone activity 
cannot be predicted based on the protein domain structure or 
the existence of discrete motifs. In the case of virus-encoded 
RNA chaperones, even RNA chaperone proteins of the same 
viral genus have little sequence similarity. Nevertheless, RNA 
chaperones have the highest frequency of disordered regions, 
and it has been proposed that they act according to an entropy 
transfer model, allowing correct RNA folding by successive 
cycles of protein-substrate order-disorder (5, 32). In fact, pro- 
tein disorder has been the selected criterion for investigating 
the chaperone activity of candidate viral RNA chaperone pro- 
teins (14, 36). The list of virus-encoded RNA chaperones has 
been quickly growing. Nevertheless, their role in the viral life 
cycle as RNA chaperones is still unclear, mainly due to the 
difficulty of analyzing in vivo the chaperone activity of these 
proteins in viral infection. 

Coronaviruses are enveloped viruses of the Coronaviridae 
family, included in the Nidovirales order (7, 10). Their genomes 
are positive-sense, single-stranded RNAs (ssRNAs) of around 
30 kb, the largest known viral RNA genomes. The 5’ two-thirds 
of the genomic RNA encodes the replicase proteins. The 3’ 
third of the genome encodes structural and nonstructural pro- 
teins. Coronavirus transcription leads to a nested set of sub- 
genomic (sg) mRNAs that are generated by a discontinuous 
mechanism. This process implies base pairing of nascent RNAs 
of negative polarity, synthesized under the control of transcrip- 
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tion-regulating sequences (TRSs) preceding each gene, with 
sequences located at the 3’ end of the leader within the 
genomic RNA (21, 23, 35). 

Coronavirus nucleocapsid protein (N protein) has a struc- 
tural role and is involved in RNA synthesis (1, 6). The N 
proteins from different coronaviruses vary in length and pri- 
mary sequence. Nevertheless, some motifs with functional rel- 
evance are conserved, and N proteins share a three-domain 
organization according to sequence similarity (18). Recently, 
based on disorder predictions, a modular organization includ- 
ing two structured domains separated by a long disordered 
region was proposed for coronavirus N protein (2, 36). Using 
transmissible gastroenteritis virus (TGEV) as a model, RNA 
chaperone activity has been demonstrated for N protein (36). 
RNA chaperone activity was also reported for severe and acute 
respiratory syndrome virus (SARS-CoV) N protein, and it was 
postulated as a general activity of all coronavirus nucleocapsid 
proteins (36). 

The template switch during discontinuous RNA synthesis in 
coronavirus transcription is a complex process that includes 
several steps: the slowing down and stopping of nucleic acid 
synthesis, the template switch itself, reassociation of the nas- 
cent nucleic acid strand with the acceptor sequence, and elon- 
gation of the nascent strand using the acceptor nucleic acid as 
a template (8). To accomplish some of these steps, we have 
proposed that RNA chaperones, such as TGEV N protein, 
could decrease the energy barrier needed to dissociate the 
nascent minus RNA chain from the genomic RNA template 
(36). This decrease in the energy threshold would facilitate the 
nascent RNA chain template switch to hybridize with the TRS 
of the leader sequence during discontinuous transcription. 

The role of coronavirus N protein in RNA synthesis has 
been controversial. On one hand, it has been reported that 
viral replicase gene products suffice for coronavirus replication 
(30). On the other, the same authors and others have reported 
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that addition of N protein enhances RNA synthesis (1, 22, 31). 
Whether this enhancement is due to an increase in replication, 
transcription, or both has not been clearly established. 

In this article, we report the generation of a set of TGEV 
N-protein deletion mutants and the analysis of their RNA 
chaperone activity, using annealing assays. The role of wild- 
type (wt) N protein and deletion mutants in template switching 
was evaluated in vitro using a retrovirus-derived strand transfer 
system. Our data indicated that the central disordered domain 
of the TGEV N protein has RNA chaperone activity and 
facilitates template switching in vitro. Moreover, the role of N 
protein in vivo in TGEV replication and transcription was 
evaluated. Interestingly, the results reported in this paper in- 
dicate that N protein is not essential for TGEV RNA replica- 
tion. In contrast, it is required for efficient transcription. These 
data reinforce the hypothesis of N protein having a role in 
template switching. 

Nucleic acid binding of N protein deletion mutants. To 
locate a TGEV N protein domain with nucleic acid chaperone 
activity, a set of deletion mutants was designed according to 
several criteria: predicted domain distribution, order-disorder 
pattern, predicted location of crystallized N-terminal (NTD) 
and C-terminal (CTD) domains from other coronavirus N pro- 
teins, and predicted RNA-binding residues (Fig. 1A). The cor- 
responding fragments of TGEV N-protein coding sequence 
were amplified by PCR and cloned into the plasmid pGEX- 
4T-2. The N-protein mutants AN1, AN2, AN3, AN4, and ANS 
(Fig. 1A) were successfully expressed and purified from Esch- 
erichia coli as previously described (36). 

RNA chaperone activity requires nucleic acid binding, and 
the ability of the constructed N protein deletion mutants to 
bind different nucleic acid species was evaluated by Northwest- 
ern blotting. Purified protein fractions were separated in 10% 
SDS-PAGE gels and transferred to nitrocellulose membranes. 
Blocking and incubation with biotin-labeled probes was per- 
formed as previously described (26). Three types of probes 
were used: 16-nucleotide (nt) single-stranded RNAs of both 
viral (leader TRS) and cellular (glyceraldehyde 3-phosphate 
dehydrogenase) origins, 18-nt single-stranded DNAs (ssDNAs), 
and longer ssRNAs of around 130 nt (acceptor and donor 
RNAs from the retrovirus-derived template switch system). 
Probe sequences are available from the authors upon request. 
Biotinylated RNAs were detected using the BrightStar BioDetect 
kit (Ambion) (Fig. 1B). The intensity of the binding was esti- 
mated by band densitometry. Since full-length TGEV N pro- 
tein and its deletion mutants were prone to degradation, a 
property related to the disordered nature of the protein (16, 
27, 34), binding values were calculated with reference to the 
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amount of each intact protein fragment (Fig. 1B, top panel). 
As expected, the glutathione S-transferase (GST) control pro- 
tein did not bind to any nucleic acid probe, while N protein 
bound to all nucleic acid species (Fig. 1C). N-protein mutants 
lacking the C-terminal domain (AN1 and AN2) and mutant 
AN3, containing just the central disorder domain of N protein, 
bound to all nucleic acid probes as well as or even better than 
N protein. The mutant AN4 hardly bound short nucleic acids, 
but it bound long ssRNAs as well as N protein. Finally, the 
mutant ANS, containing just the C-terminal domain of N pro- 
tein, did not bind any nucleic acid (Fig. 1C). These results are 
in agreement with previously reported RNA binding domains 
for other coronavirus N proteins (2, 3, 9, 13, 17, 20, 24-27). 

RNA chaperone activity of N-protein deletion mutants de- 
termined by nucleic acid annealing assays. Facilitation of 
rapid and accurate nucleic acid annealing is one of the prop- 
erties of RNA chaperones in general and coronavirus N pro- 
tein in particular (36). Both DNA and RNA annealing assays, 
followed by proteinase K treatment to rule out nucleic acid 
aggregation mediated by protein binding, were performed as 
previously described (36). In the absence of protein or in the 
presence of GST, background levels of double-stranded DNA 
(dsDNA) were obtained (Fig. 2A). The amount of dsDNA was 
increased in the presence of full-length N protein, as expected. 
The mutants AN1 and AN2 also increased the formation of 
dsDNA. Incubation with the mutant AN3, containing the cen- 
tral disordered domain of N protein, showed the largest in- 
crease in dsDNA levels. In contrast, incubation with the mu- 
tants AN4 and ANS provided background levels of dsDNA 
(Fig. 2A). Similar results were obtained when the base pairing 
of viral TRS RNAs was analyzed (Fig. 2B). Altogether, these 
data suggested that the mutants AN1, AN2, and AN3 may have 
RNA chaperone activity. 

Role of N protein in in vitro template switching. An in vitro 
template switch system based on human immunodeficiency 
virus (HIV) was used to evaluate the involvement of N protein 
in template switching (11). Using a labeled DNA primer, HIV 
reverse transcriptase (RT) synthesizes a cDNA copy of a donor 
RNA of 131 nt (strong-stop DNA [SSDNA)). In the presence 
of an RNA chaperone, template switching occurs, and a la- 
beled transfer product of 185 nt was obtained (Fig. 3A). This 
system has strong RNA chaperone activity requirements in 
different steps, such as the unwinding of stable nucleic acid 
secondary structures, inhibition of self-priming, enhancement 
of template switching, an increase in RNase H activity, and 
improvement in RT processivity (15). Template switch reac- 
tions were performed as previously described (11), except for 


FIG. 1. Nucleic acid binding of N protein mutants. (A) Scheme of TGEV full-length N protein (wt, upper bar) and deletion mutants (AN, lower 
bars). Disorder-order pattern, residues corresponding to crystallized amino- and carboxy-terminal domains (NTD and CTD, respectively), and 
prediction of RNA binding residues according to the RNAbindR server (28, 29) are indicated. Labels on the left indicate the protein name. SR, 
Ser-rich domain; RRM, RNA recognition motif; NLS, nuclear localization signal; Ac, acidic domain. The numbers indicate the corresponding 
amino acids in N protein. (B) Western blot of protein fractions (upper panel) and Northwestern blot using different biotin-labeled probes (lower 
panels). Proteins were detected with an antibody specific for the GST tag (a-GST). Nondegraded protein species are indicated by black 
arrowheads. The nature of the nucleic acid probe is indicated at the right of the figure. The numbers on the left indicate molecular masses in kDa. 
(C) The intensity of the bands binding the probe was estimated by densitometry and corrected by the amount of intact protein in each case. Binding 
of N protein was considered to be 100 relative units (r.u.) of activity in each case; this threshold is indicated by the black dashed line. Error bars 


represent the standard deviations from four independent experiments. 
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FIG. 2. Nucleic acid annealing assays. (A) DNA annealing assays 
were performed using an 18-mer biotin-labeled DNA oligonucleotide 
(Oligo C) and a 56-mer unlabeled DNA oligonucleotide (Oligo B) that 
under reaction conditions forms a stable secondary structure that must 
be unwound to allow double-stranded DNA formation. The ratio of 
dsDNA to ssDNA was estimated by densitometry (graph) and refer- 
enced to that obtained in the presence of wt N protein, considered 100 
relative units (r.u.) (indicated by the black dashed line). (B) RNA 
annealing assays were performed using a biotin-labeled 16-mer RNA 
oligonucleotide representing the transcription regulating sequence of 
the leader (TRS-L) and an unlabeled 16-mer RNA oligonucleotide 
with the complementary sequence to the gene 7 TRS (cTRS-7). The 
ratio of dsRNA to ssRNA was estimated as for panel A. Error bars in 
panels A and B represent the standard deviations for results from four 
independent experiments. 
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the use of a biotin-labeled DNA primer and detection of bio- 
tinylated reaction products with the BrightStar BioDetect kit. 

In the presence of increasing amounts of HIV nucleocapsid 
protein (NCp7), a well-known RNA chaperone, an increase in 
the formation of transfer product was observed (Fig. 3B). In 
contrast, the levels of transfer product remained constant 
in the presence of the GST negative control protein (Fig. 3B). 
The generation of transfer product as the result of template 
switching was monitored in the presence of full-length N pro- 
tein and its deletion mutants (Fig. 3B). Template switch effi- 
ciency was estimated by densitometry as previously described 
(11) (Fig. 3C). An increase in the TGEV N protein concen- 
tration led to an increase in the amounts of transfer product 
(Fig. 3C). This result indicated that TGEV N protein acts as an 
RNA chaperone in the HIV-derived system, facilitating tem- 
plate switching in vitro. Incubation with N-protein mutants 
lacking the C terminus (AN1 and AN2) and the mutant AN3, 
containing the central disorder domain of N protein, also pro- 
duced an increase in the levels of transfer product (Fig. 3C). In 
contrast, the mutants AN4 and ANS did not produce an in- 
crease in the amounts of transfer product (Fig. 3C). It is worth 
noting that at the higher concentrations analyzed, wt N protein 
and mutants AN2 and AN3 led to a decrease in the amount of 
transfer product (Fig. 3C). This effect is probably a conse- 
quence of the RNA chaperone activity of the proteins, enhanc- 
ing the RNA degradation due to nucleases present in the 
protein fractions (15, 36). 

The purification of in vitro active coronavirus replication- 
transcription complexes was recently described (33). Unfortu- 
nately, these systems cannot be used to analyze the role of 
chaperones in template switching, since their chemical compo- 
nents are not fully defined. Most importantly, it is likely that 
these systems will already include the N protein with the RNA 
chaperone activity that we would like to evaluate, in addition to 
other cellular proteins that also have RNA chaperone activity. 

Role of N protein in in vivo coronavirus RNA synthesis. 
Using TGEV-derived replicons, our group has previously 
reported the requirement of N protein for efficient RNA syn- 
thesis (1). To dissect the role of N protein in coronavirus 
replication and transcription, TGEV replicons containing full- 
length N protein (N wt) or lacking N protein (AN) were used 
(Fig. 4A). Their replication and transcription levels were ana- 
lyzed by quantitative RT-PCR using specific TaqMan assays 
for negative and positive strands, respectively. RNA levels 
were compared in all cases with the levels obtained from a 
nonreplicative replicon (NR) (Fig. 4A), leading to background 
reference levels of replication (Fig. 4B) and transcription (Fig. 
4C). In the absence of N protein, genomic RNA levels of the 
replicons including or lacking N protein were similar when 
both negative and positive RNA strands were analyzed. Fur- 
thermore, addition of N protein in rans led to an up to 10-fold 
increase of replication levels for both replicons (Fig. 4B). 
These data indicate that N protein is not essential for corona- 
virus replication, and they are in agreement with previous 
observations (30). The TGEV replicon containing full-length 
N protein transcribed very efficiently (Fig. 4C). In contrast, 
replicon AN, lacking N protein, led to basal levels of transcrip- 
tion that were increased up to 100-fold by adding N protein in 
trans (Fig. 4C). These data indicate that although not abso- 
lutely essential, N protein is required for efficient coronavirus 
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FIG. 3. Jn vitro template switching assay. (A) HIV-derived template switching system. Using a biotin-labeled DNA primer, HIV RT synthesizes 
a cDNA copy of a donor RNA of 131 nt (strong-stop DNA [SSDNA)). In the presence of an RNA chaperone, template switching occurs and a 
labeled transfer product (T) of 185 nt was obtained. R, repeated sequences, including transactivation response (TAR) element sequence. 
(B) Template switch assay in the presence of control NCp7 and GST proteins or TGEV N protein deletion mutants. (C) Template switch efficiency 
was estimated by densitometry of labeled T and SSDNA bands, and the transfer product amount was calculated as a percentage of the ratio T/(T + 
SSDNA) in each case. Error bars represent the standard deviations from four independent experiments. 


transcription. The obtained results confirm previous data on 
the enhancement of RNA synthesis by N protein (1, 31), indi- 
cating that the increase in RNA synthesis from the presence of 
N protein is due mainly to an increase in transcription levels. 

Overall, the results shown indicate that the RNA chaper- 
one activity of TGEV N protein is located in its central 
disordered domain (aa 117 to 268) and that this activity 
requires the ability to bind RNA and additional character- 


istics present in the AN3 N-protein fragment. To our knowl- 
edge, this is the first demonstration of a nonretroviral RNA 
chaperone acting in an in vitro template switching system. 
The results obtained confirm the RNA chaperone activity of 
TGEV N protein and its potential role in template switch- 
ing. In agreement with this, in vivo results indicated that N 
protein is required for efficient transcription. Nevertheless, 
additional experimental evidence will be required to defin- 
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FIG. 4. Jn vivo coronavirus RNA synthesis. (A) Scheme of TGEV-derived replicons, containing full-length N protein (N wt) or lacking N 
protein (AN). A nonreplicative replicon (NR) containing full-length N protein, but unable to replicate due to a mutation affecting several replicase 
genes, was also used. All replicons contain gene 7 to allow measurement of transcription (1). (B) Quantification of negative and positive strands 
of genomic RNA (gRNA) using specific TaqMan assays. Experiments were performed as previously described (1), including a DNase I treatment 
to eliminate DNA from transfection. BHK-pSIN, BHK cells transfected with Sindbis virus replicon; BHK-pSIN-N, BHK cells expressing N protein 
from Sindbis virus replicon. (C) Transcription levels measured by quantification of subgenomic mRNA of gene 7, both negative and positive strand, 
using specific TaqMan assays. Error bars represent the standard deviations from five independent experiments. 


itively establish the implication of N protein in coronavirus 
template switching. 
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